Brief Announcement: Object Replication Degree Customization for High Availability∗

نویسندگان

  • Ming Zhong
  • Kai Shen
  • Joel Seiferas
چکیده

Object replication is commonly employed to enhance the availability of data-intensive services. As far we we know, existing availability-oriented replication schemes are oblivious to object request popularities when determining object replication degrees. However, many large-scale data-intensive applications contain objects with highly skewed data object request popularity distributions. Such nonuniform popularities may be exploited to improve system availability under a given space constraint. Intuitively, using more replicas for popular objects (less for unpopular ones) can increase the overall expected service availability while keeping the total space cost unchanged. Here by expected service availability, we mean the proportion of successful service requests (whose requested objects are available) in all requests. Using a simplistic model, we propose a novel replication degree policy that maximizes the availability for systems with known object popularities and sizes. Specifically, we find that the optimal replication degree for each object i should be linear in log ri si , where ri is the normalized object popularity and si is the normalized object size. Our evaluation uses applications driven by large real-life system data object request traces and failure traces. Results show that our proposed customization can achieve significant system availability increase over uniform replication.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Data Availability Using Combined Replication Strategy in Cloud Environment

As grow as the data-intensive applications in cloud computing day after day, data popularity in this environment becomes critical and important. Hence to improve data availability and efficient accesses to popular data, replication algorithms are now widely used in distributed systems. However, most of them only replicate the static number of replicas on some requested chosen sites and it is ob...

متن کامل

An Analysis of Replication Enhancement for a High Availability Cluster

In this paper, we analyze a technique for building a high-availability (HA) cluster system. We propose what we have termed the ‘Selective Replication Manager (SRM),’ which improves the throughput performance and reduces the latency of disk devices by means of a Distributed Replicated Block Device (DRBD), which is integrated in the recent Linux Kernel (version 2.6.33 or higher) and that still pr...

متن کامل

Reliability and Availability Improvement in Economic Data Grid Environment Based On Clustering Approach

Abstract - One of the important problems in grid environments is data replication in grid sites. Reliability and availability of data replication in some cases is considered low. To separate sites with high reliability and high availability of sites with low availability and low reliability, clustering can be used. In this study, the data grid dynamically evaluate and predict the condition of t...

متن کامل

Realize: Resource Management for Soft Real-Time Distributed Systems

The Realize system simplifies the development of complex applications by separating the application programming from the management of resources for soft real-time CORBA applications, and from the replication of CORBA objects to provide high availability and fault tolerance. Realize uses totally ordered multicast messages to maintain consistency of the states of the object replicas, and adjusts...

متن کامل

Brief Announcement: Optimal Atomic Broadcast and Multicast Algorithms for Wide Area Networks∗

Distributed applications spanning multiple geographical locations have become common in recent years. Typically, each geographical site, or group, hosts an arbitrarily large number of processes connected through high-end local links; a few groups exist, interconnected through high-latency communication links. As a consequence, communication among processes in the same group is cheap and fast; c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007